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I. Real Party in Interest 

The real party in interest in the present appeal is Intel Corporation of Santa Clara, 
California, the assignee of the present application. 

li Related. .Appeals I t )U it c< 

There are no related appeals or interferences to appellant's knowledge thai would 
have a bearing on any decision of the Board of Patent Appeals and Interferences. 

HI- Status, o f the Claims (inde p endent, claims shown in bold) 
Claims 1-40 are pending in the application. 

Claims 1-9, 10-19, 20-27 and 28-34 stand rejected under 35 USC § 101 as 
allegedly being directed to a program per se. 
Claims 35-40 are allowed. 

Final rejection of claims 1-9, 10-19, 20-27 and 28-34 is being appealed. 
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IV. Status, of Amendment s 

An official amendment and response to a first Office Action mailed 8/17/2006 
was submitted by appellant on 2/20/2007 and was entered. A Final Office Action was 
mailed on 3/26/2007. A Notice of Appeal was transmitted on 6/26/2007, and an appeal 
ensued. 

Accordingly, the claims stand as of appelant' s response of 2/20/2007, and are 
reproduced in clean form in the Claims Appendix. 

V. .Summary,^ 

Appellant's disclosure describes methods, apparatus, and program means for 
performing parallel table lookup using SIMD (single instruction multiple data) instructions 
for performing operations on packed data, The method of one embodiment comprises 
loading a table having a set of L data elements. A determination is made whether the table 
fits into a single register, if the determination indicates that the table does, in fact, fit into 
a single register, a data lookup into the table is performed with a packed data shuffle 
operation. If the table does not fit into a single register, the table is divided into sections, 
Each of the sections is sized to fit into a single register. Packed data shuffle operations are 
then executed on these sections to look up data in the table. 

Claim 1, for example, sets forth a method comprising: loading a table having a set 
of L data elements 1 ; determining whether said table fits into a single register 2 ; performing 
a data lookup into said table with a packed data shuffle operation if said determination 



* "At block 1202. a tabic having a plurality of data elements are loaded."' (US 2004/0054879 A3 $>. 15, 
par. 122, lines 2-3; Fig. .12, 1202) 

: *. determination is mad b 1204 ;-i I lliet the table fits in a single register 
(US 2004/0054879 At p, 15, par. 122, lines 3-5; Fig. 12, 1204) 
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indicates that said table does fit into a single register 3 ; and dividing said table into a 
plurality of sections if said table does not fit into a single register, each of said sections 
sized to fit into a single register, and executing a plurality of packed data shuffle 
operations on said plurality of sections to look up data in said table 4 . 

Similarly, claim 20 sets forth an article comprising a tangible machine readable 
medium that stores a program, said program being executable by a machine*** to perform a 
method comprising: determining whether a table having a set of L data elements fits into a 
single register 2 ; performing a data lookup into said table with a packed data shuffle 



Merits im ingle reg)S ic tat kup is performed w t h . p bl 
l2Ur (US 2004/0054879 .41 p. 15, par. 122, lines 5-7: Fig. 12, 1216) 

* "One embodiment of a method for handling ovetwed tabic* divides a table into sections, cadi equal to 
the capacity of a register, and accesses each of these table sections with a skiffle instruction." (US 
2004/0054879 Al p. 12, par. 109, lines 12-16) "If the data does not fit into a single register table lookup 
is to be performed with shuffle operations for each relevant portion of the table at block 1206." (US 
2004/0054879 Al p. 15, par. 122, lines 7-9; Fig. 12, 1206) 

* "As processor technology advances, newer software code ts also being generated to run on machines with 
these processors i pt ct and demand higher performance from their computers regardless 
of the type of software being used. One such issue can tn» > i ■ kinds ol I i ind operations 
that a nil i t i j s i i 
complete based on the complexity of the operations and/or type of circuitry needed. This provides an 
opportunity to optimize the way certain cotopk < operations are executed insid be » - ssoj " (US 
2004/0054879 Al p. 1 , par. 5} "In an embodiment, the methods of the present invention are embodied in 
niachinc-exccutable instructions Hk d to cause t general-purpose or special- 
purpose processor that is programmed with the instructions to perform the steps of the present invention " 
(US 2004/0054879 Al p. 2, par. 33, lines 1-5) "Although the below examples describe instruction 
handling and distribution in the context of 'exectitkn i hts a hit « it a t ht en < luncnts of the 
present invention can be accomplished by way of software. The present invention may be provided as a 
computer program product or software which may include a machine or compu er-readabl medium ten mg 
stored thereon instatci.io.ns which .may be used to program a computer (or other electronic devices) to 
perform a process according to the present invention Such software can be stored within a memory in the 
system." (US 2004/0054879 Al p. 2, pan 34, lines 1-11) "Accordingly, the computer-readable medium 
includes am type of media mael > i ible medi i suitable for s ring o t smsttiu electronic 
instructions or information in a form readable by a machine (e.g., a computer)." (ITS 2004/0054879 Al p. 
2, par. 35, lines 1-5) 

* "Memory 120 can be a dynamic i > id< 1 -ess memory (DRAM) device, a static random access memory 
(SRAM) device, flash memoiy device, or other memory device. Memory 120 can store instructions and/or 
data represented by data signals that can be executed by the processor 102." (US 2004/0054879 A i p. 4. 
par. 51, lines 4-9, Fig, 1A, 120) "The instruction, prefetcher 22b fetches macro-instructions from memory 
and feeds them to an instruction decoder 228 which in. (urn decodes them into primitives called micro- 
instructions or micro-operations (also called micro op or uops) that the machine know how to execute The 
trace cachi 230 ta.kes decoded nop and assemb th n mo program ordci sctptcnecsor » s in the 
nop queue 234 for execution. When the trace cache 230 encounters a complex macro-instruction, the 
microcode ROM 232 provides the uops needed to oomplete the operation," (US 2004/0054879 Al p. 6, 
par. 64, lines 16-26; Fig. 2, 230) 
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operation if said determination indicates that said table does fit into a single register 5 ; and 
dividing said table into a plurality of sections if said table does not fit into a single register, 
each of said sections sized to fit into a single register, and executing a plurality of packed 
data shuffle operations on said plurality of sections to look up data in said table 4 . 

Claim 28 further sets forth an apparatus comprising: an execution unit 7 to execute a 
sequence of instructions 6 , said instructions to perform a table lookup operation, said 
instructions to cause said execution to: determine whether a table having a set of data 
elements fits into a single register 2 ; perform a data lookup into said table with a packed 
data shuffle operation if said determination indicates that said table does fit into a single 
register 3 ; and divide said table into a plurality of sections if said table does not fit into a 



' "Figure 1A is a block diagram of a computer system 100 formed with a processor 102 that includes one 
or more execution units 108 to perform a data shuffle algorithm in accordance with lite present, invention." 
(US 2004/0054879 At p. 4, par. 48, lines 1-4, Fig. 1A, 108) "Execution unit 108, including logic to 
perform integer and floating point operations, also resides in the processor .102. The processor 102 also 
includes a miciocod I ' us microcode for certain macroinstnfctions. Fortius 

embodiment, execution unit 108 includes logic to handle a packed instruction set ! 09. In one embodiment, 
the packed instruction set 109 tuc ides i p ked shuffle instruction for organizing data (is 
2004/0054879 Al p. 4, par, 50, lines 1-8, Fig, 1A, 108) "Alternate ensbodinients of an e vecution unit 108 
ii i d 1 i ■ rtroliets, cml led proa - > 1 ) <> < v s oth i x I 





" (US 2004/0054879 At p. 4, par. 


51, lines 1-3, Fig. 1 A, 108) "Processing core 159 


comprises an 


execution unit 142. a set of regisfe 


r file(s) 145, and a decoder 144. Processi ng core 159 also 


includes addit 


tonal circuitry (noi shown) which 


is not ik essan io the idersi i the i icscnt 


invention. E> 


reculion unit 142 is used for execitt 


tug instructions received by processing core 159. In 


addition to :re< 




nous, execution unit 142 can recognize instructions m 



packed instruct ion set 143 for pc» i i i o > it 1 t n ! it nat I t mutt n su 143 
n ud > fi 1 oris o pportin huffl ierai oris md i . - il > in fti h rth »ackcd ii situ lions 
(US 2004/0054879 A1 p. 5, par. 57, lines 1-12, Fig. IB, 142) "For one embodiment, SMD coprocessor 
1 6 1 comprises an execution unit 1 62 and a set o f registe r fi lets ) 1 64. One embodiment of main processor 
165 comprises a decoder 165 to recognize instructions of instruction set 163 including SIMD shuffle 
instructions for execution bv execution unit 162. (US 2004/0054879 Al p. 5-6, par. 61 , lines 1-6, Fig. 
1C, 162.) "1 ic embod \ t 170 5»> > SIMD coj '<>. 

are integrated into a single processing core 1 70 comprising an execution unit 162, a set of register file(s) 
164,. and a decoder 165 to tiecognize instructions of instruction set 163 inchid Six Tsliu.ffk 
instructions (US 2004/0054879 At p. 6, par. 63, lines 10-16, Fig. It , 162) " The execution block 21 1 
contains the execution units 212, 21.4, 216. 2.18, 220, 222, 224, where lite instructions are actually 
executed. This section includes the. register files 208. 210. that store flic integet and floating point dala 
operand values that the uucro-ittstmctions need to execute. The processor 200 of litis embodiment is 
comprised of a. number of execution units iddt s te ration unit (AGU) 212 -u A 214 fast ALU 216 
fast ALU 218, slow ALU 220. floating point ALU 222. floating point move unit 224. For this 
embodiment the iloatins p< text ution blocks 222, 224, execute fk i «, MM IMD and SSL 
operations/' (US 2004/0054879 Al p. 7, pan 69, lines 1-12, Fig. 2, 222, 224) "The data operands 510. 
520 re s I e shuffle li 530 of an e\ ioi s 1 < »c procc >r along vvttl s i l * J ictio 
f U S 2004/0054879 At p. 9, par. 80, lines 2-5, Fig. 5, 530) 
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single register, each of said sections sized to fit into a single register, and execute a 
plurality of packed data shuffle operations on said plurality of sections to look up data in 
said table 4 . 

The method of an alternative embodiment comprises loading two M-bits wide data 
portions and shuffling both in accordance with an M-bits wide mask to generate shuffled 
results, then merging selected data elements of the shuffled results to accomplish an M-bits 
wide table lookup. 

Claim 10, for example, sets forth an alternative method for table lookup 
comprising: loading data for a first M-bits wide portion 8 and data for a second M-bits wide 
portion of a table 9 ; loading an M-bits wide mask, said mask comprised of N control 
elements, each control element corresponding to a unique data element position 19 ; 
shuffling said first M-bits wide portion in accordance to said M-bits wide mask to generate 
a first shuffled result 4 '"; shuffling said second M-bits wide portion in accordance to said 
M-bits wide mask to generate a second shuffled result 4 ' 12 ; 



N "This first set of data elements >s grouped as an operand named LOW TABLE DATA 1021." (US 
2004/0054879 Al p. 13, par. 1 I I, lines 3-4, Fig. 10A, 1021) "At block 1 104. the data dements for a first 
portion Of a table or a first data set is loaded." (US 2004/0054879 At p. 14, par. 120, lines 10-12, Fig, 11, 
1104) 

9 "This second set of data elements is grouped as an operand named HIGH TABLE DATA 105 J ." (US 
2004/0054879 Al p. 13, par. 11 L lines 10-18, Fig. 10B, 1051) "Data elements for a second portion of a 
tabic or a second data set is loaded at block 1108. " (US 2004/0054879 Al p. 14, par. 120, lines 14-15, 
Fig. 11. 1108) 

10 "MASK 100! and LOW TABLE DATA .1021 are eaeii comprised of sixteen elements in this example."' 
(US 2004/0054879 Al p. 13, par. Ill, lines 4-6, Figs. 10A-B, 1001) -Because the same set of masks 
1001 were used with both the LOW TABLE DATA 1021 and HIGH TABLE DATA 1051. their respective 
resultants 1041, 1042, appear to have similarly positioned data, but from different source data."' (US 
2004/0054879 Al p, 13, par. 1 12, lines 1-5, Figs. 10A-B, 1001) "At block 1102. a set of shuffle masks 
des i i I k pattet s received (US 2004/0054879 Al p. 14, par. 120, lines 6-8, Fig. .11,1 102) 
Jl "' A shuffle operation of MASK 1001 and LOW TABLE DATA 102! yields a resultant TEMP 
RESULTANT A 1041." (US 2004/0054879 Al p. 13, par. Ill, lines 6-8, Fig. 10A, 1041) "The first 
ponton data elements are shuffled in accordance to the shuffle pattern of block 1 102 at block 1106." (US 
2004/0054879 At p. 14, par. 120, lines 12-14, Fig. 11, 1106) 

n "A skiffle operation of MASK 1001 and HIGH TABLE DATA 1051 yields a resultant TEMP 
RESULTANT B 1042."" (US 2004/0054879 Al p. 13, par. Ill, lines 19-21, Fig. 10B, 1042) The second 
portion data elements are shuffled in accordance to the shuffle pattern of block 1 102 at block 1 110." (US 
2004/0054879 A l p. 14, par. 1.20, lines 15-17, Fig. 1 1, 1 110) 
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and merging selected data elements from said first and second shuffled results to obtain an 
M~bits wide table lookup resultant 1- *. 



n "Fig. 10H illustrates ihe merging of ihe selected da ton th I si dai c im the second date set. A 
packed logical OR operation is performed on SELECTED LOW TABLE DATA IW9 and SELECTED 
HIGH TABLE DATA 1050 to obtain MERGED SELECTED TABLE DATA 1070, winch is the desired 
resultant of die p illel table 1 n > nitfmi in this, example In an alternative embodiment, a packed 
KlditKMiopemtiontoaddtngahu Milt Hi'lOW i -\1H I i)\IU04<> inci SELEf LED HIGH 
TABLE DATA 1050 can also yield MERGED SELECTED TABLE DATA 1070." CCS 2004/0054879 AT 
p. 14, par. 118, tines 1-11, Fig. 10C-H, 1070) 

"At block 1112, table selects are filtered out from the shuffle masks. The table selects of this embodiment 
involve the so ram At block 

1 1 14. a table select mask is generated for the shuffled data from the first portion of the table A table select 
mask is general i for tin huft led d ita f rem the second portion of the table at block J 1 1(». These table 
select masks are to filter out the desired shuffled data elements for specific data element positions from the 
appropriate table data source. 

At block .1118. data elements arc selected from the shuffled data of ihe first table portion in accordance 
with a table select mask of block 1 114 for the first table portion. Data elements arc selected at block 1120 
from the shuffled data of the second table portion in accordance with the table select mask of block 1116 
for the second table portion. The shuffled data elements selected from the first table portion at block 1118 
and from the second table portion at block 1120 are merged together at block ! 122 to obtain merged table 
data, fhemeu i tabl t.a >f i i< ml >dnn t mcltidt da ek tent.' huffled from both the .first table 
data and the second table data. For another embodiment the merged table data can include data looked up 
front more than two table sources or memory regions.' - (US 2004/0054879 At p. 14-15, par. 120, line 17 
through* par. 121 line 14, Fig 11,1 112-1122)" 
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VL Grounds of Rej ecti on to be Reviewed on Appeal 

A. Claims 1-9, 10-19, 20-27 and 28-34 stand rejected under 35 (JSC § 101 as 
allegedly being directed to a program per se. 

VII Argument 

A. 35 U.S.C. § 10 1 Rf JUTIONS 

Claims 1-9, 10-19, 20-27 and 28-34 stand rejected under 35 (JSC § 10 J as 
allegedly being directed to a program per se. The Final Office Action (p. 2, lines 18-20) 
states that since the claimed invention can be accomplished by way of software, it reads 
on a program per se. 

Appellant respectfully submits that, if the examiner's assertion were true, then 
any process that could be automated through programmed machines would be non- 
statutory as a program per se. But it may very well be the case that any process which 
can be listed as steps in a claim can in fact be automated by programmed machines. 
Therefore, according to the examiner's assertion, any process which can be listed as steps 
in a claim is not patentable. 

To this point, the Federal Circuit explained in ATS. TCorp, v. Excel 
Communicatiom, Inc. 172 F.3d 1352, 1356, 50USPQZD 1447, 1450 (Fed. Cir. 1999) 
that (emphasis added ): 

El y point 1 [ t i it electronic, chemical 

mechanical h < ' »■ in the i < ise of the term \. (ah Street Bank A 

n hh . 14<> F.3d 1368, 1374-75. 47 USPQld 1596, 1602 

(Fed. Cir. 1998). 
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Yet. 35 U S C. §101 clearly sets forth that (emphasis added), 'Whoever invents or 
discovers any new and useful process , machine, manufacture, or composition of matter, 
or any new and useftil improvement thereof, ma.y.ob in tt.ent.thj t ubject to the 
conditions and requirements of this title." 

Thus, contrary to the examiner's assertion, the intent of such broad coverage for 
"any new and useful process" does not prohibit some particular new and useful processes 
simply because they can be performed by, or make use of a computer program. 

Therefore, appellant respectfully submits that the Examiner is in error for 

incorrectly concluding that the processes of claims 1 and 10, the executable program on a 

tangible machine readable medium of claim 20, and the programmed apparatus of claim 

28 are each directed to programs per se (i.e. programs of, in or by themselves, their 

functionally descriptive material standing alone without reference to additional means 

which would allow said functionality to be realized). Indeed, the MPEP §2106.01 further 

states that (emphasis added): 

Computer programs are often rcc d as part of a claim L StMO personnel should t 

<! i i , i ■ > i . { it ) >j i if i t t i n i.u 1 1 icl.i t 

orinacjiiiKj . In such a case, the claim remains statutory irrespective of the fact that a computer 
program is included in the claim. The Mine result occurs when a computer program is used in 
a computerized process where the computer executes the instructions set forth in the 
computer program. Oniv wheni he e l t Ijrec led to a mere 

i isl.ii , , .i , t. t , , , i« 1 < t v i , I <: ;n 

hence nonstatutory. 

Since the instant claims are not directed to a mere program listing but rather to 
otherwise statutory processes, articles and apparatus, the question at hand, is whether 
such processes are "new and useful." 

An analysis of the instant claims must be performed in order to make a 
determination of whether the subject matter is statutory. Such analysis should correlate 

42P15764 -10- 



each claim element with corresponding structures, materials or acts set forth in the 

specification. 

Claim 1, for example, sets forth: 

I. (Original > A method comprising; 

loading a tabic having a scl of L data elements: determining whether said table fits into a 
smgle register 

performing a data lookup into said tabic with a packed data siiuffi< ope rati m si id 
determination indicates that said fable does fit into a single register, and 

dividing said table into a plurality of sections if said tabic does not lit into a single 
register * el t id se tons sized to fit into a single register, and executing a plurality of 
packed data shuttle operations on said plurality of sections to look up data in said fable. 



The Federal Circuit makes it clear that the ordinary and customary meaning of a 
claim term is the meaning that the term would have to a person of ordinary skill in the art 
in question at the time of the invention. "The person of ordinary skill in the art is deemed 
to read the claim term not only in the context of the particular cl aim in which the 
disputed term appears, but in the context of the entire patent, including the specification." 
PhMipsv. AWHCorp., 415F.3dat 1313. 

Appellant respectfully submits that the instant language, when correlated with the 

corresponding structures and processes set forth in the specification (e.g. see Figures 

1 OA-H, 1. 1 and .1 2, paragraphs 1 10-122) makes it. apparent to one of skill in the art. that 

the claimed invention has practical applications in the technical arts, i.e. to order data 

from, small lookup tables such as may be used in video and encryption applications. 

In addition. Applicant respectfully submits, that the present application clearly 

asserts such a practical application in the technical arts. 

For example, paragraph 39 of the specification (emphasis added) asserts that: 

Embodiments of the present invention provide a way to implement a packed byte shnifle 
instruction with a flush to zero capability as an algorithm that makes use of S.IMD related 
utrd For one enibod i i i < d 1 s 1 I on fh > i tun < ,f i 
apaitiuil; egiste or memon location based i lues o! ntrol n each data 
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element position. Embodiments of a packed byie shuffle can be used to reduce the number of 
instructions required m many < t< ppl ions that i rat tta j> d byte shuffle 
instruction can i I' sed for any application with unaligned loads Embodiments of this 
shuffle instruction can be used ibr filtering to arrange data for efficient nmltipl v-accainiulaie 
operations. Similarly a packed shuffl m si. met a L mb used n ideo and i ^npton 

i < l.t.l I i h h ] i i i (, i to mis 

da! iro mootini registet fhus cm! dimentsol , eked shuffle with a flush to zero 

• ^ 1 Igoritbm in cordai < th the present invention c nbe ipleme ted in a 
processor to snppoi ilMI) o{ uku efficient - ithout seriously oi mumsing overall 
performance. 

Paragraph 45 of the specification (emphasis added) asserts that: 

Similarly . the reversing of all lite bytes in a 128 bit register, such as in changing between big 
endian and little endian formats, can be easily performed with a single packed shuffle 
instruction. Whereas even these fairly simple patterns require a number of instructions if a 
packed shuffle instruction were not. used, complex or random patterns require even more 
inefficient instruction routines. The most straight forward sohit.io.ft to rearrange random bytes 
in a SIMD register is io write them to a buffer and then use integer byte ready writes to 
rearrange them and read them back into a SIMD register. AH these data processing would 
require a lengthy code sequence, while a single packed shuffle instructions can suffice. By; 
reducing the numb,: id hi ■ . m 1 - ' 'Ik.- number of clock c\ clcs needed to produce 

he a > < I i l i, u x bodmu 1 j , ui u <. t n ! i < . i« ' 

. . - - ^ '- i nlc \ nines in a tabic \\\\h a ^ ■ - i in t he case 

whet tht LtaWcjsiwrecJk >g;coi ij;egistei al; /•' tings in ... ;;ord mc .\ tth die \ra sent 

> > in t foraa to data el eng Ltjifasfen i i t < i ! i t.pu 
iirsiriiclton is ys tl mi gei operations 

Paragraphs .108-109 of the specification (emphasis added) assert that: 

Currently, table lookups using integer instructions requires a large number of instructions. An 
even greater tin tube t of insf motions are needed per lookup if integer operations arc used to 
access data for algorithms implemented with SIMD instructions. But by i 1 1 

LpacMd byt hiLffieiustn i lermsimcPon > mi n< t lioi it is drasf 11 
dri 1 . For j n i u > > bytes can be a ;se; durm ijabk < j > ih i u_ I 
mstnraton il ik I bh sa<e is M\feai Ig tes oi k>s Ik- n si\in msimcliojiscanbe used to 

table daia if thejaj i:w yewycon s r....a aa:i; • 1 s. Iy\y;igyjl;!rec 
•> 1 ., « ut i .it etygagntimi i mci f i . yen 

There are some applications with data parallelism thai cannot be implemented with SIMD 
instructions due to sheir use of lookup tables f he i i m n id 1 mg algorithms ol 

pi i i i algoritJun thai uses small j 

abies thai may not fit inlo a i?.S bif register. In some cases, the lookup tables used by these 
mis ai u tl! If diL . ft hi ii i it. ible lookup op< i n b 

iccoii (Mt d„ shuffle, jristruci ul i i i 

ilte table esc i truction 

can si - nk »' 1 tlffl) O nlMj) 'i' 1 t i) , i< t • ; s 1 

i i" 1 ' 1 1 i i , i t icii eq ( i ! nd 

of these t seep hi 1 i i i ' i 11 i ion uses ti 1 - i 

> i > i < i i ' t t. ( i resu i i 'pi! 

tmplemenfcd m these isus with (he g tehed by fe slmiile msf rtUion thus gei uniting the use 
>} SJMD i u i f npi .,i ',. i j i t 1 mbodif ^ Ik pa.su 

tin cut too can kip impto\v peifui mane i. aid t wditue the iiiinkvi oi muiion ui.ssss w s ik sded 
i ' i ithi ( t maJfJ fag ibises )t5iei dm i jx ih.ac of.rniijtig 
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t > t , I enjetil n 51 1 mctipo , ! t s to ' 

O 11 «, I | 1 • ' t (puiJIiis JtiU NiMl) UlMfik i ip in f ! i 

I km 1 tip i, I i ,! . 1 ! i ,» ! >» ml ,l I. s 

embodiment of the present invention demonstrates how to access data from a table fiat 
requires memory space larger than a single register. In tins example the registers contain 
different segments of {lie table 

Paragraph 147 of the specification (emphasis added) asserts that: 

L ) i , si ) ii is icked.shuiT! oel.i , < j I i mi i » t> oj 

invention can also improve prose-- a.J - -a •:.> '< n .i« " a a ;v. --iti hardware 

it i mck \ continues to mtprmt. embodiments of the present .tn.ve.ot.too 



i:u!s,.c ive.;m.cyc(i. ij .a , > . . , >er[onnan i < n, 

• i ij e instruct! n having byte ^tnmimtx indaflushto 

zero option can have different and greater impact across processor generations. Siniply 
adding more resources in modem processor architectures alone does not ^.antntce ■ e 
!?£jfonmmeeJm^ 




Thus the specification makes it readily apparent to one of skill in the art that the 
claimed invention has a practical application in the technical arts. 

The Supreme Court held that the focus in any statutory subject matter analysis be 
on the claim as a whole, stating "When a claim containing a mathematical formula 
implements or applies that formula in a structure or process which, when considered as a 
whole, is performing a function which the patent laws were designed to protect (e.g., 
transforming or reducing an article to a different state or thing, then the claim satisfies the 
requirements of § 101." In re AlappaL 33 F.3d 1526, 1 543 (Fed. Cir. 1994) (quoting 
Diehr, 450 U.S. at 192, 209 USPQ at 10). 

This notion is sometimes phrased in terms of requiring a transformation or 
reduction of 'subject matter; In Schroder, the phrase 'subject matter' was determined not 
to be limited to tangible articles or objects, but includes intangible subject matter, such as 
data or signals (e.g. encrypted and/or video communication), representative of or 
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constituting physical activity or objects. Schroder, 22 F.3d at 295, 30 USPQ2D (BNA) at 
1459. 

Thus appeiiant respectfully submits that Claims 1-34 are directed to statutory 
subject matter. 

Conclusion 

Appellant submits that ail claims now pending are in condition for allowance. 
Such action is earnestly solicited at the earliest possible date. If there is a deficiency in 
fees, please, charge our Deposit Acct. No. 50-0221. 

Respectfully submitted, 



Date: August 27, 2007 



INTEL CORPORATION 
c/o INTE ELEVATE LLP 
P.O. Box 52050 
Minneapolis, MN 55402 
(408) 765-2194 



/$/Lawrence M. Me nnemeier/ 
Lawrence M. Mennemeier 
Reg. No. 51,003 
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VIII Claims. Appendix; Claims All owed and fnyolyed.in.A^ 

1 . (Original) A method comprising: 

loading a table having a set of L data elements; determining whether said table fits 
into a single register; 

performing a data lookup into said table with a packed data shuffle operation if said 
determination indicates that said table does fit into a single register; and 

dividing said table into a plurality of sections if said table does not fit into a single 
register, each of said sections sized to fit into a single register, and executing a plurality 
of packed data shuffle operations on said plurality of sections to look up data in said 
table. 

2. (Original) The method of claim I further comprising loading a lookup mask for each 
packed data shuffle operation, said lookup mask to indicate which data elements are to be 
extracted from said table. 

3. (Original) The method of claim 2 wherein said lookup mask is comprised of L 
shuffle masks, each shuffle mask corresponding to a unique data element position. 

4 (Original) The method of claim 3 wherein each shuffle mask is comprised of: a flush 
to zero field, said flush to zero field to indicate whether a data element position 
associated with this shuffle mask is to be filled with a zero value; a selection field, said 
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selection field to indicate which table data element to shuffle daks from; and a source 
select field, said source select field to indicate which of said plurality of table sections to 
shuffle data from for this shuffle mask. 

5. (Original) The method of claim 2 further comprising merging shuffle results from 
said plurality of packed data shuffle operations into a single register. 

6. (Original) The .method of claim 3 wherein each packed shuffle operation comprises: 
for each shuffle mask, shuffling data from a data element designated by said shuffle mask 
to an associated resultant data element position if its flush to zero fieid is not set and 
placing a zero into said associated resultant data element position if its flush to zero field 
is not set. 

7. (Original) The method of claim 6 wherein a capacity of a single register is 128 bits. 

8. (Original) The method of claim 7 wherein each data element is a byte wide and each 
shuffle mask is a byte wide. 

9. (Original) The method of claim $ wherein said lookup mask is 128 bits long and L is 
less than seventeen. 

10. (Original) A method for table lookup comprising: 

loading data for a first M-bits wide portion and data for a second M-bits wide portion 
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of a table, loading an M-bits wide mask, said mask comprised of N control elements, 
each control element corresponding to a unique data element position; 

shuffling said first M-bits wide portion in accordance to said M-bits wide mask to 
generate a first shuffled result; 

shuffling said second M-bits wide portion in accordance to said M-bits wide mask to 
generate a second shuffled result; 

merging selected data elements from said first and second shuffled results to obtain 
a.n M-bits wide table lookup resultant. 

1 1. (Original.) The method of claim 10 wherein said table and said portions of said table 
are comprised of packed data elements. 

12. (Original) The method of claim 1 1 wherein said first M-bits wide portion, said 
second M-bits wide portion, and said M-bits wide table lookup resultant are each 
comprised of N packed elements. 

13. (Original) The method of claim 1 1 wherein M is 128 and N is 16. 

14. (Original) The method of claim 12 wherein each control element is comprised of: a 
flush to zero field, said flush to zero field to indicate whether a data element position 
associated with this control element is to be filled with a zero value; a selection field, said 
selection field to indicate which table data element to shuffle data bom; and a source 
select field, said source select field to indicate which of said plurality of table sections to 
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shuffle data from for this control element. 

15. (Original) The method of claim 10 further comprising generating a table select mask 
from M-bits wide mask, said table select mask to indicate which table section each 
resultant data element position should receive data from. 

16. (Original ) The method of claim 15 further comprising: applying said table select 
mask to said first shuffled result, wherein a first shuffled data element is selected from 
said first shuffled result; and applying said table select mask to said second shuffled 
result, wherein a second shuffled data element is selected from said second shuffled 
result. 

17. (Original) The method of claim 16 wherein said merging selected data elements 
comprises merging data from said first shuffled data element and said second shuffled 
data element into said M-bits wide table lookup resultant, data from said first and data 
from said second shuffled data elements are to each occupy a separate data element 
position. 

18. (Original) The method of claim 10 further compri sing determining whether said tabic 
for said table lookup can fit into a single register, where if true, performing said table 
lookup with a shuffle operation on said table with said M-bits wide mask instead of 
performing lookups on multiple portions of said table. 
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j 9. (Original) The method of claim 18 wherein said single register is a 1 28 bit wide 
single instruction multiple data register, M less than 129, and said table is less than 129 
bits wide, 

20. (Previously Presented) An article comprising a tangible machine readable medium 
that stores a program, said program being executable by a machine to perform a method 
comprising: 

determining whether a table having a set of L data elements fits into a single register; 

performing a data lookup into said table with a packed data shuffle operation if said 
determination indicates that said table does lit into a single register; and 

dividing said table into a plurality of sections if said table does not fit into a single 
register, each of said sections sized to tit into a single register, and executing a plurality 
of packed data shuffle operations on said plurality of sections to look up data in said 
table, 

21. (Original ) The article of claim 20 wherein said method further comprises loading a 
lookup mask for each packed data shuffle operation, said lookup mask to indicate which 
data elements are to be extracted from said table. 

22. (Original) The article of claim 21 wherein said lookup mask is comprised of i, 
shuffle masks, each shuffle mask corresponding to a unique data element position. 
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The article of claim 22 wherein each shuffle mask is comprised of: a flush 
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to zero field, said flush to zero field to indicate whether a data element position 
associated with this shuffle mask is to he filled with a zero value; a selection field, said 
selection field to indicate which table data element to shuffle data from; and a source 
select field, said source select field to indicate which of said plurality of table sections to 
shuffle data from for this shuffle mask. 

24. (Original) The article of claim 20 wherein said program further comprises merging 
shuffle results from said plurality of packed data shuffle operations into a single 
instruction multiple data register. 

25. (Original) The article of claim 23 wherein each packed shuffle operation comprises: 
for each shuffle mask, shuffling data from a data element designated by said shuffle mask 
to an associated resultant data element position if its flush to zero field is not set and 
placing a zero into said associated resultant data element position if its flush to zero field 
is not set. 

26. (Original) The article of claim 25 wherein each data element is a byte wide and each 
shuffle mask is a byte wide 

27. (Original) The article of claim 26 wherein said single register has a capacity of 128 
bits and L is less than seventeen. 

28. (Original) An apparatus comprising: 
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an execution unit to execute a sequence of instructions, said instructions to perform a 
table lookup operation, said instructions io cause said execution to: 

determine whether a table having a set of data elements fits into a single regi ster; 

perform a data lookup into said table with a packed data shuffl e operation if said 
determination indicates that said table does tit into a single register; and 

divide said table into a plurality of sections if said table does not fit into a single 
register, each of said sections sized to fit into a single register, and execute a plurality of 
packed data shuffle operations on said plurality of sections to look up data in said table. 

29. (Original) The apparatus of claim 28 wherein said instructions are to further cause 
said execution unit to load a lookup mask for each packed data shuffle operation, said 
lookup mask io indicate which data elements are to be extracted from said table. 

30. (Original) The apparatus of claim 29 wherein said lookup mask is comprised of a 
plurality of shuffle masks, each shuffle mask corresponding to a unique data element 
position. 

31. (Original ) The apparatus of claim 30 wherein each shuffle mask is comprised of: a 
flush to zero field, said flush to zero field to indicate whether a data element position 
associated with this shuffle mask is to be filled with a zero value; a selection field, said 
selection field to indicate which table data element to shuffle data from; and a source 
select field, said source select Held to indicate which of said plurality of table sections to 
shuffle data from for this shuffle mask. 
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32, (Original) The apparatus of claim 31 wherein said execution is to comprises merging 
shuffle results from said plurality of packed data shuffle operations and to store said 
merged shuffle results into a single instruction multiple data register. 

33, (Original) The apparatus of claim 31 wherein each packed shuffle operation 
comprises, for each shuffle mask, shuffling data from a data element designated by said 
shuffle mask to an associated resultant data element position if its flush to zero field is 
not set and placing a zero into said associated resultant data element, position if its flush 
to zero field is not set. 

34. (Original.) The apparatus of claim 33 wherein each data element is a byte wide and 
each shuffle mask is a byte wide. 

35. (Original) A system comprising: 

a memory to store data and instructions; a processor coupled to said memory on a 
bus, said processor operable to perform instructions for a table lookup algorithm, said 
processor com pri si ng : 

a bus unit to receive a sequence of i nstructions .from said memory; 

an execution unit coupled to said bus unit, said execution unit to execute said 
sequence, said sequence to cause said execution unit to. 

determine whether a table having a set of data elements fits into a single register; 
perform a data lookup into said table with a packed data shuffle operation if said 
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determination indicates that said table does fit into a single register; and 

divide said table into a plurality of sections if said table does not tit into a single 
register, each of said sections sized to fit into a single register, and execute a plurality of 
packed data shuffle operations on said plurality of sections to look up data in said table. 

36. (Original) The system of claim 35 wherein said instructions are to further cause said 
execution unit to load a lookup mask for each packed data shuffle operation, said lookup 
mask to indicate which data elements are to he extracted from said table. 

37. (Original) The system of claim i>t> wherein said lookup mask is comprised of a 
plurality of shuffle masks, each shuffle mask corresponding to a unique data element 
position, and wherein each shuffle mask is comprised of; a flush to zero field, said flush 
to zero field to indicate whether a data element, position associated with this shuffle mask 
is to be filled with a zero value; a selection field, said selection field to indicate which 
table data element to shuffle data from; and a source select field, said source select field 
to indicate which of said plurality of table sections to shuffle data from for this shuffle 
mask. 

38. (Original.) The system of claim 37 wherein each packed shuffle operation comprises, 
for each shuffle mask, shuffling data from a data element designated by said shuffle mask- 
to an associated resultant data element position if its flush to zero field is not set and 
placing a zero into said associated resultant data element position if its flush to zero field 
is not set. 
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39. (Previously Presented) The system of claim 38 wherein said execution unit is to 
merge shuffle results from said plurality of packed data shuffle operations and to store 
said merged shuffle results into a single instruction multiple data register, 

40. (Original) The system of claim 39 wherein each data element is a byte wide and each 
shuffle mask is a byte wide. 
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IX. Evidence Appendix; With Copies of Evidence Relied Upon by Appellant 
Appellant relies upon no additional evidence in this appeal 
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X. Related rocei nas appendix Copies of Dec is ons Rendi n d b\ a C oust 01 the 
Board in any Prior Pen 

There are no related appeals or interferences to appellant's knowledge that would 
have a bearing on any decision of the Board of Patent Appeals and i nterferences. 
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